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Abstract 

We show that the Entropy Photon-Number Inequahty (EPnl) holds where one of the input 

0^' states is the vacuum state and for several candidates of the other input state that includes the 

cases when the state has the eigenvectors as the number states and either has only two non-zero 

eigenvalues or has arbitrary number of non-zero eigenvalues but is a high entropy state. We also 

^ ■ discuss the conditions, which if satisfied, would lead to an extension of these results. 

1 Introduction 
> 

s^ • The Entropy Photon Number Inequality (EPnl) was conjectured by Guha et. al. m. EPnl has a 

ff^ . classical analogue called Entropy power inequality which is stated as follows. Let X and Y be 

CN I independent random variables with densities and h{X) be the differential entropy of X, then 

^ : e2'^(-^) + e^'^^^) < e2^(^+^) (1) 

holds. It was first stated by Shannon in Ref. lH and the proof was given by Stam and Blachman 

nasi. 

The EPnl has some important consequences in quantum information theory. In particular, 

if this conjecture is true, then one would be able to establish the classical capacity of certain 

5^ I bosonic channels |[T] |5l. EPnl is shown to imply two minimum output entropy conjectures, 

which would suffice to prove the capacity of several other channels such as the thermal noise 

channel [51 and the bosonic broadcast channel Hill- 

The statement of the inequality is as follows. Let a and b be the photon annihilation operators 
and let the joint state of the modes associated with a and b be the product state, i.e., pab = Pa® 
Pb, where pA and pB are the density operators associated with the a and b modes respectively. 
For the beam-splitter with inputs a and b and output c with transmissivity -q and reflectivity 1 — rj 
respectively, the annihilation operator evolution is given by 



c = ^a + ^l - 7]b, (2) 
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The EPnl is now stated as 

g-' [S{pc)] > ng-^ [S{pa)] + (i - v)g-^ [S{pb)] , 0) 

where 

g{x) = [x +1) log(a; + 1) — x log(x) (4) 

is the von Neumann enti'opy of the thermal state with mean photon-number x, and 
S{p) = — Tr(/0 log p) is the von Neumann entropy. 

In this paper, we prove the EPnl for the case of pB to be the vacuum state, pA having 
its eigenvectors as the number states and either having two nonzero eigenvalues or high von 
Neumann entropy with arbitrary number of eigenvalues. There are other candidates as well for 
which some special cases EPnl hold and these are mentioned later. 

2 The beam-splitter transformation 

We obtain the output density matrix pc from the beam-splitter transformations. The annihilation 
operators for the two outputs are 



c = y/rja + yjl -r]b, (5) 

d = e"^(yr^a-^6), (6) 

where [a, a^] = [b, b^ = [c, c^] = [d, d^ = I and [a, b] = [a, c] = [a, d] = and so on. We 
assume that the inputs density operators are diagonal in the number state basis and hence, 

oo oo 

PAB = ^^Xiyj \i)^ Ij)^ (iU {J\b ' (7) 

i=0 j=0 

where Xi and yj ai^e the ith and jth eigenvalues of A and B respectively, |i)^ and \j)^ are the 
Fock number states for the systems A and B respectively. Any state \i)^ \j) ^ can be written as 
(see Ref. fS]] for example) 

|.)^|j)^ = i^^|0)^|0)3. (8) 

From © and ©, we get a'^ = ^c^ + \J\ — 'r]e''^d^ and b'^ = yJT^^c^ — ^e'''^d^. Using 
these with ([S]), we get the transformation 

WaWb — > rf T^T |0)cIO)d> (9) 



where B.S. indicates the action of the beam splitter. Using the fact that the operators c^ and d^ 
commute and the binomial expansion, we get 



(ct)(i+i)-(fc+/)(^t)fc+^ |0)c |0)^ . (10) 



Incorporating the action of c^ and S on the vacuum states of C and D, we get 

^/[{i + 3)-{k + l)]\{k + l)\ \{i+j)-{k + l))c \k + l)^. (11) 
Hence, we arrive at the expression for pcD as 

i 3 i J 



PC. = E E -«4 E E E E ■=""*"-"'*''"*(- 1)'*'' (I) 



'^"'"\)[l)ii 



Tj 2 "*" 2 (1 — jyJJ 2 "^ 2 



\/[(^ + j) -{k + mik + l)\y/[{i + i) - (fc' + I'Mk' + /')! 

|(i+i)-(A; + 0)c;|fc + 0D((i+i)-(fc' + Olc(fc' + ^%- (12) 

Now, tracing out system D, we get 



^c ^EE-*^EE E E(-i)"' , 

i=0 j=0 '•'' k=0 1=0 k'=Ol'=0 \ / \ / \ / \ 

fc+fc' I l+l' . . ■ i+i' I fc+fc' 

7) 2 "•" 2 (1 — TyV 2 "•" 2 

[(i + i)-(A: + /)]!(A: + /)! |(i + j) - (A; + /)) ((i + j) - (^ + 01 <5fc+«,fe.+z.. (13) 

We now consider the special case when pB is a vacuum state. Let the set of all probability 
vectors (with infinite length) be denoted by P and if a; € P, then X^^q ^j = 1 and Xj > V 
i > 0. Then (fT3] ) reduces to 

oo 

Pc = ^Zi\'^)ck^\c-> (14) 

i=0 

where z = M^(a;) = M{ri,x), M : [0, 1] x P — )• P is a transformation given by 

^^ = eC)^'(i-^)'"'^'^- (15) 

Hence, Q reduces to 

g~' {H[M^{x)]} > i]g-' [H{x)] . (16) 

Note that this equation is expected to hold for all x G P and rj G [0, 1]. The inequality is 
trivially true for 77 = since Mo{x) = [1,0, ...] implying H[Mo{x)] = 0, and for r] = 1 since 

Ml (x) = X. 

3 Pa is two-dimensional in the number state basis and 
pB is the vacuum state 

Let 

Hbip) = -plog{p) - (l-p) log(l - p) (17) 



to be the binary entropy of a two-point probability distribution [p, I — p] with < p < 1. 
Let the eigenvalues of pA given by the probability vector x = [1 — a,a,0,...]. Therefore, 
H{x) = Hb{a) and i/[M^(x)] = Hh{ria). We now prove (IT6l ) for the above case. 

Lemma 1. For all rj G [0, 1] and a € [0, 1], we have 

g~' [Htim)] > vg~' [Hb{a)] . (18) 

with equality if and only ifij G {0, 1} or a = 0. 

Proof. One can see that g^^ [Hi,{r]a)] = 7]g~^ [Hi,{a)] if r^ G {0, 1} or a = 0. In all other 
cases, we show that 

g-' [H^irn)] > V9~' [Hbia)] ■ (19) 

Let /(/3) = g~^ [//^(/J)]. The Lemma is equivalent to showing that /(/3)//3 is a strictly 
decreasing function in < /3 < 1. Note that since g{/3) = Ffe(/3)+2 [log(2) - Hb (1/2 + /3/2)] 
and log(2) > Hb (1/2 + /3/2) for all /3 G (0, 1), hence g{P) > Hb{f3) for all < /3 < 1. Since 
g is one-to-one and increasing, we have g~^ [Hb{/3)] < /3 for all < /3 < 1 or /(/3) < /3 for all 
< /3 < 1. 

It is not difficult to see that 

d f{/3) log{(l-/3)[l + /(/?)]} 



d/3 p ^2 log 



l+/(/3) 

~7WV 



(20) 



and since, using /(/?) < /3 for all < /3 < 1, it follows that (1 - /3)[1 + /(/?)] < 1 for all 
< /3 < 1, hence, /(/3)//3 is a strictly decreasing function in < /3 < 1. D 

Recall that if the distribution of a random variable X is Binomial, denoted by Bin(L, 77) G P, 
then Bin(L, 77, A;) = Pr{X = k} = {^rf{l - rf)^"^ if A; G {0, 1, ..., L} and is zero otherwise. 

Let the two non-zero entries of the probability vector x^'^ be at the A^-th and P-th position, 
i.e., xat = 1 - a, xp = a and let 2^'^ = M^(a;^'^). 

Lemma 2. For all rj G [0, 1], a G [0, 1] and L > 1, we have 

g-' [if (2^'^)] > w~' [Hix'^'n] ■ (21) 

Proof. By Lemma U we have 

g-^ [Hbiva)] > rjg-^ [Hb{a)] . (22) 

Note that g is one-one and and strictly increasing, therefore g~^ is also strictly increasing. There- 
fore, it is enough to prove that 

H{z^'^) > H{z^'^). (23) 

as if (zO'i) = Hbiva) and if (x^'^) = Hbia). We first show that 

H{z^'P) > H{z°'^). (24) 

Note that 

H{z'''') =f [a, (1 - Tjf] + aH [Bin(P, r/)] , (25) 



where 

/(a, x) = — [(1 — a) + ax] log [(1 — a) + ax] — (1 — x)a log(a) + x log(x)a. (26) 

It is not difficult to show that f{x) is a decreasing function of x. Note that i? [Bin(P, r])] in- 
creases with P. Since H{x^'^) is a sum of two functions each of which increases with P, ((24l) 
follows. 

Next, we show that for all N, P > 0, we have 

^(^7V+i,p+i) > ^(2^.^). (27) 

Note first that Bin(A^ + 1, 7]) = (1 — r])Bm{N, r/) + r7Bin-|_i(A^, rj), where if X has distribution 
Bin+i(iV, r]), then Fy{X = A; + 1} = Bin(iV, t], k) V k. This implies that 

^iv+i,P+i ^ (1 _ ^)^^,P + ^^iv,P^ (28) 

where we define z_y_\ similarly. Using H{z^'^) = H{z^[ ), it is not difficult to show that 



^^^N+l,P+l^ = F(2^'^) + (1 - V)D [2^'^||2^+1'^+^] + VD 



^N,Pn N+1,P+1 

■'+1 II* 



(29) 



where i3)(-||-) is the relative entropy that is always non-negative and hence, (|27] ) follows. 
Assume w.l.o.g. that P > N. Applying (l24l ) repeatedly followed by (l27l) . we get 

/7(0^'^) > i7(20'^-^) > ^(zO'i). (30) 

The result follows. D 



4 p^ has number states as eigenvectors and pB is the 
vacuum state 

We have observed that the EPnl holds when pA has two non-zero eigenvalues with eigenvectors 
as the number states and pB is a vacuum state. We now consider the case when pA has number 
states as the eigenvectors and could have arbitrary number of nonzero eigenvalues and pB is the 
vacuum state. We derive some necessary and sufficient conditions for this inequality to hold. 

We first note that M,, [Mr^ix)] = Mr,rf{x) \f r],r]' G [0, 1] and aj G P. To prove this, let 
y = Mn'{x), z = Mj^{y) and note that 

°° 'k 



■■^(.W{i-v)'-'yk (31) 

k=i ^^^ 

:J2 (^)^Xl - V)'-' E Q ivTil - V'y-'x, (32) 

j=i ^ ^ k-i=0 ^ ^ 

oo 

■■J2^vv'Xj- (34) 



To simplify the notation, let us define 

H{r],x) = H{Mr,x) 



(35) 
(36) 



As Ml is an identity transformation, we sometimes write H{x)forH{l,x) and h{x)foTh{l,x). 
Note that h{l,x) = g^^ [H{x)] and therefore, (fT6l) can be rephrased as 



h{7],x) 



> h{l,x). 



(37) 



It is not difficult to see that if (IT6l ) holds, then h{r], x)/r] is a decreasing function in r/. To see 
this, let r]' < rj and 6 = r]' /r] where < 6 < 1. Then 



h{ri',x) _ h[6,Mr^{x)] 1 
5 rj 



rj' 



> 



h[l,Mrj{x)] 
V 



As h{i],x)/r] is differentiable, we have 

d h{r),x) dH{r],x) 



drj 7] 

Lemma 3. Let M^ : [0, 1] x 
equivalent: 



dr] 



H{r], x) + log [1 + h{r], x)] . 



(38) 
(39) 
(40) 

(41) 



be the transformation given by (I15I ). The following are 



(n) 
(in) 



h{r],x) > r]h{l,x) 
d h{rj,x) 



dr] 7] 
d h{r],x 



dr] r] 



77=1 



< 

< 



Va;GP,V?7G (0,1], (42) 

VxGP,Vr/G (0,1], (43) 



VxG 



(44) 



Proof. It is clear from (l40l ) that (i) and (ii) are equivalent. Furthermore, {ii) implies (iii) since 
{in) is a special case of (ii). We prove that (iii) implies {ii). Note that 



d h[P,M^{x)] 



dp 



P 



d h{ri(3,x) 



/3=i df3 13 

2 d h{9,x) 



r] 



Now {Hi) implies that 



and hence, (ii) follows using (l46l) . 



d h{e,x) 



de 6 



d9 



< 



e=r] 



(45) 
(46) 

(47) 

D 



We now state EPnl in (IT6] ) in the form of an entropic inequality, i.e., an inequality involving 
Shannon entropy of discrete probability distributions. By Lemma[3l (IT6l ) is equivalent to 






(48) 



The above can be expressed as 
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^H{r,,x)-r,^S^ 



1 



>H{7],X). 



(49) 



Note that g(l//3 - 1) = Hb{P)/(3 V /3 G [0, 1] and hence, ([H]) is equivalent to showing that 

Hb 



H{r},x) < 



TTi ^\ I dHiri.x) 



-H{v,x)+r,iliin^ 



(50) 



For the two dimensional case with t] = 1, x = [a,l — a,0, ...], a € [0,1], H{r],x) 
r]dH{7],x)/dr] = — log(a), H{x) = Hb{a), and substituting this in (l50l ). we get 



Hb{a) < 



Hb{a) 



a 



(51) 



which is true. This gives a short proof of ( fT6] ) for this special case. Evaluating (|50l ) at ?] = 1 
gives an interesting expression that depends only on the distribution x. It is shown in (l62l ) that 



G(x) = 
and hence, (l50l ) reduces to 



dH{ri,x) 



drj 



H{x) < 



r,=l 



y^ ixi log 



i=l 



a^i-i 



^^ [g-//(x)+e(x)] 



(52) 



(53) 



The above inequality involves only entropies and another function Q of the distribution but, to 
the best of our knowledge, has never been studied before in the literature. 
We now show that if (IT6l ) is true, then it implies that 



dH{r],x) 

V 1 — < 1, 

drj 

dH{rj,x) 

7] <H{r],x). 

dr] 



(54) 
(55) 



If (IT6l) holds, then using Lemma[3l we have H{r],x) — r]dH [r] , x) / d7] > log [1 + h{rj,x)]. As 
log [1 + h{r],x)] > 0, we have H{r],x) — r]dH{r],x)/dr] > 0, which proves (1551 ). 

Using Lemma [3] again, we have 7]dH{r],x)/d7] — H{i],x) + log [1 + h{ri,x)] < 0. It is 
enough to prove that H{r],x) — log [1 + h{7],x)] < 1, i.e.. 



l+g~^[H{i],x)]>e"^''^''^"\ 



(56) 



We first consider the case when < H{r],x) < 1. Then e^^^'^' ^ < 1- Therefore, 1 + 
C/-1 [H{r],x)] > eH{v,^)~i ^nd ^ holds. 

Now consider H{r],x) > 1. Hence, it is enough to prove that 1 + g~^{x) > e^~^ V x > 1, 
or, X + 1 > g{e^ — 1) V x > 0. Simplifying, we can show that this is equivalent to showing that 

r(e-^) > 0, where r : [0, 1] -> M and 

r{x) = X + {1 — x) log(l — x). (57) 

Note that r(0) = and dr{x)/dx = - log(l - x) > V x G [0, 1]. Therefore, r(x) > V 
X G [0, 1] and dSUl follows. 

(l54l ) and (1551 ) are the necessary conditions for (IT6l ) to hold. We now show that they both hold 
under general conditions. 



Lemma 4. For all r/ G [0, 1] and x G P, the following hold: 



^d^(_>^ < 1, (58) 



/-^ < m,,.) (59) 



with equality if and only if A/I^{x) = [1,0, ...]. 
Proof. Let z = M^{x) and using 



we get 



dzi 

v-r = ^^* - (^ + i)^i+i, (60) 

ar] 



<W|,x)^^ d^ (61) 

dry ^ dr? 

OO ^ V 

= Vizaog^ (62) 



a 



OO 



= -1, (64) 

where in a, we have used the inequality that log(x) > 1 — 1/x for all x > with equality if and 
only if X = 1. If 2 is such that Zj 7^ V i, then it is impossible to have an equality in a since 
equality would imply Zj_i = Zi\/ i and this would imply that Yl'i^o -^« ^^ unbounded. 

If z has a finite number of nonzero values say z = [zq, zi, ..., zl-i,0, •■•], then (l64l ) can be 
further tightened as 

dH{ri,x) 

V 1 < 1 - LzL^i- (65) 

ar] 



Hence, (O holds. 

We now prove ( |59l ) or equivalently 

6(2) = -J2izilog(^^)< H{z). (66) 

i=l \^i-l/ 

Let us define a sequence of probability distributions {z^^^}, L = 0,1, ..., where z^^^ has length 
L + 1 and z^^^ = [(1 - zl)z^^~'^\ zl] and 2^°) = [1]. It is easy to see that the following 
recuiTcnce relations hold 

e(2(^)) = (1 - Zi)G(z(^-l)) + LZL log f^^ZL-l) (67) 

Hiz^'^^) = (1 - zl)H{z^'^-'^) + H,{zl). (68) 

Define 

5(2^) ^ e(2(^)) - //(2(^)). (69) 

Using the recurrence relations in (|67] ) and (1681 ). we get 

H(2(^)) = (1 - zl)H(2(^-i)) + LzL log (I^ZL-^ - H,{zl). (70) 

We now claim that 

H(2(^))<Llog(l-2L). (71) 

We prove this by induction. It is easy to check that H(2^^^) = log(l — zi). Let (T/T]) hold for 
L — 1, L > 1. Then we have 

H(2(^)) = (1 - Zi)H(2(^-^)) + LzL log {^-^ZL-^ - H,{zl) (72) 

<{L- 1)(1 - ZL)log(l - ZL^i) + {L- l)zLlog{zL-l)+LzLlog ' ^ ~ ^"^ 



ZL 

- Hb{zL) (73) 

= -{L - l)d{zL, ZL-i) + Llog(l - zl) (74) 

<Llog{l-ZL), (75) 

where in a, we have used the induction hypothesis and the fact that z^ log(zL-i) < 0, in b, 

d{x,y) =xlog( -j +{l-x) log (^^j (76) 

is the relative entropy between [x, 1 — x] and [y,l — y] and is always nonnegative. ( [59l ) now 
follows from (TtTI ) since log(l — zl) < 0. The equality condition follows straightforwardly. D 

It is not difficult to see that the sufficient condition for (fT6l) to hold is that dH{r], x)/dr] < 0. 
This condition is, of course, not true for many distributions such as a distribution whose sequence 
of entries are non-increasing. Suppose z = Mri{x) has some zero entries in its interior, i.e., 



Zi = and Zj+i / for some i. Then one can easily check that dH{7],x)/dr] = — oo and (IT6l) 
holds. It also follows from (1651 ) that if, for distributions with finite non-zero entries of the form 
z = [zo,zi,..., zl-1,0, ...] and zl-i > l/L, then (fT6l ) holds. 
We now show that (IT6l ) holds if -ff (aj) is sufficiently large. 

Lemma 5. For a given rj G (0, 1), a; G P, (fT6l ) /zoW^ if H{x) is large enough. 

Proof. Using (l49l ). we need to show that 



a 



^H{r),x)-r)dH(ri,x)/dri _ -^ 



>H{r],x) 



We have 



^H{r!,x)-ridH{r),x)/dri _ I ^ fjf„ x) + 6 — Q-^'<'1^^)+'^'^^'^'n^^)/'^'n 



(77) 



(78) 

(79) 
(80) 



where in a, we use the inequality that g{'s^ — 1) > x + 1 — e~^ and we use Lemma |4] to get 
rjdH{rj,x)/dri < 1 — 5 for some 5 > 0, in b, we use 7]dH{r],x)/d7] < 1 and the last inequality 
would hold if H{rj, x) > 1 — log(5) or if H{r], x) is large enough. 

We now show that if H{x) is large, then so is H{r],x) for rj G (0, 1). Define 



q{r],x) 



H{r],x) 



(81) 



Differentiating w.r.t. rj, we get using (1591 ). 



dq{r],x) 
drj 



1 
< 0. 



dH{i],x) 

V 1 H{r],x) 

drj 



(82) 
(83) 



Hence, q{rj,x) is a decreasing function of rj and H{ri,x) > 7]H{x). Similarly, using ( [58] ). we 
get 



Hence, 



H{r],x) > H{x) + log(r?). 
H{ri,x) > m.ax{r]H{x),H{x) + log(r/)} . 



(84) 
(85) 

(86) 



This shows that if H{x) is large, then so is H{r], x) and hence, ( |T6l ) would hold for any ry G (0,1] 
for large f/^ (a;). D 
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5 Discussion 

It is, of course, of great interest to see if these results could be generalized for the cases where 
Pa and ps do not have the special structure such as the eigenvectors being the number states etc. 
It would seem that our results may extend over to cover some of these cases if the following is 
established. Suppose there exists an s G P such that 



d h[ri,Mp{x)] 



dr] 7] 

Then, it follows from ^6^ that 

d h{r],x) 



<0V/3g(0,1]. (87) 

ri=l 



<0V/3g(0,1]. (88) 



dr] 7] 

This would then imply that /i(/3, x) is a strictly decreasing function of /3 G (0, 1] and hence, (fT6l) 
holds with strict inequality. 

An example of such a a; is a; = [a, 1 — a, 0, ...], a / 1, and Mp{x) = [1 — (1 — a)/3, (1 — 
a)/3, 0, ...], and (O is strict using (ISTl) . 

For finite n and any state a defined on the number states as 

n 

we define a function 

n 

fin,a)= Y.^^,J\e'l){e]\, (90) 

i,j=0 

i 

where {|ef )} is the standard basis for the Hilbert space of dimension n + 1, i.e., (ej| = [O, ..., 

n—i 

,l,Or^,« = 0, l,...,n. It follows that ^(cj) = S\f{n,a)]. 
Now consider the input states such that 

riA 

PA=^ Aij|i)^(jU, (91) 

i,j=0 
riB 

PB= Yl ^i,i\'^)BiJ\B^ (92) 

i,j=0 

PA = a|0)^(OU + (l-a)|l)^(lU (93) 

/5b = |0)b(0|b, (94) 

where nA,nB are finite and 1 1 /(nA, pa) -/("-A, pa) 1 1 tr < 6 and\\f{nB, Pb)- fins, PB)\\tr < 
5. 

It is not difficult to see that under the action of /, the output pc of beam splitter with pA 
and pb as inputs is close to the output pc with pA and pB as inputs, i.e., \\f{nA + nB,pc) — 

11 



finA + nB, pc)\\tr < e, where we could make e as small as possible by choosing (5 small. Using 
Fannes' inequality IHITOl, this would result in a small deviation in the von Neumann entropies 
of PA, Pb and pc as compared to pA, Pb and pc respectively that can be absorbed while still 
preserving the inequality since the inequality is strict. 
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